IP N<3 



seq_documentation_block : 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



22-JAN-1998 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



CELK09H9 37881 bp DNA INV 
Caenorhabditis elegans costnid K09H9. 
AF043700 

AF043700.1 GI:2804458 



Caenorhabditis elegans strain=Bristol N2 . 
Caenorhabditis elegans 

Eukaryota; Metazoa; Nematoda; Chromadorea ; Rhabditida; 
Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. 
1 {bases 1 to 37881) 

, Anderson, K., Baynes^C, Berks, M,, 
Connell,M., Copsey,T., Cooper, J., 
Dear,S., Du,Z., Durbin,R., Favello,A., 
Green, P., Hawkins, T., Hillier,L., Jier,M. 
Kershaw, J . , Kirsten, J . , Laister , N . , 

McMurray , A . , Mort imore , B . , 
Rifken,L., Roopra,A., 



Wilson, R., Ainscough,R 
Bonfield,J., Burton, J. 
Coulson,A., Craxton,M. 
Fulton, L., Gardner, A., 
Johnston, L., Jones, M., 
Latreille, P. , Lightning, J . , Lloyd, C 
O' Callaghan,M. , Parsons, J., Percy, C 

Saunders, D., Shownkeen, R. , Smaldon,N., Smith, A., Sonnhammer , E , , 
Staden,R., Sulston,J., Thierry-Mieg, J. , Thomas, K., Vaudin,M., 
Vaughan,K., Waterston, R. , Wat son, A., Weinstock, L. , 
Wilkinson-Sproat, J. and Wohldman,P, 

2.2 Mb of contiguous nucleotide sequence from chromosome III of C. 
elegans 

Nature 368 (6466), 32-38 (1994) 
94150718 

2 (bases 1 to 37881) 
Madsen,C., Graves, T. and Blair, T. 

The sequence of C. elegans cosmid K09H9 
Unpublished (1998) 

3 (bases 1 to 37881) 
Waterston, R. 

Direct Submission 

Submitted (20- JAN-1998) Department of Genetics, Washington 
University, 4444 Forest Park Avenue, St. Louis, Missouri 63108, USA 
Submitted by: 

Genome Sequencing Center 

Department of Genetics, Washington University, 
St. Louis, MO 63110, USA, and 
Sanger Centre, Hinxton Hall 
Cambridge CBIO IRQ, England 

e -mail : rw®nematode . wustl . edu and j es@sanger .ac.uk 



NOTICE: This sequence may not be the entire insert of this clone. 
It may be shorter because we only sequence overlapping sections 
once, or longer because we provide a small overlap between 
neighboring submissions. 

This sequence was finished as follows unless otherwise noted: 
all regions were double stranded or sequenced with an alternate 
chemistry; an attempt was made to resolve all sequencing problems, 
such as compressions and repeats; all regions were covered by 
sequence from more than one subclone 



NEIGHBORING COSMID INFORMATION: 



The 5' cosmid is C45E1, 200 bp overlap; 3' cosmid is Y54E10. Actual 
start of this cosmid is at base position 197 of CELK09H9; actual 
end is at 37881 of CELK09H9 . 

NOTES : 

Coding sequences below are predicted from computer analysis, using 
the program Genef inder (P . Green and L. Hillier, ms in preparation) 
FEATURES Location/Qualifiers 
source i. .37881 

/organi sm= " Caenorhabdi t i s elegans " 
/strain="Bristol N2" 



/db_xref ="taxon: 6239" 
/chromosomes" I" 
/clone="K09H9" 
gene complement (11887 . .13324) 

/gene="K09H9.4" 

CDS complement (join (11887. .12723,13250, .13324)) 

/gene="K09H9 .4" 

/note-" similar to reverse transcriptase (Pfam; rvt, score; 
10.85) " 

/codon_start=l 
/evidence=not_experimental 
/protein_id="AAB97571 .1" 
/db_xref="GI :2 8044 61" 

/translations "MFAVCFSIFICFFLSSFLFLTLVNESFFKPLLFGIYVNDISSIL 
PEKVACKQFADDTKLYVSTPKAETENNLQAALDAWDWTRGAKLTLSQSKTVHVTVGR 
CRKDFKYHLDGYPIERKTITRDLGFLISEKLDFSEHWKKSINLAKFQLANIFNQYSTS 
NKKLMILLYKTFIRPRLEYGTWSSPTKKSDEKAIESVQNAFTRRLYSRIKKRYINPT 
DKDYKTSMERNELFNLQTLSTRRKIIDRIQVIRMNTGKVDLKTSEFFEQQETHTRSKK 
KYVWRTGKTKLRRHFFVNRTLSTMKQH " 

gene 15300. .16502 

/gene="K09H9.3" 

CDS join(15300. .15558,15910. .16502) 

/gene="K09H9.3" 

/note=" Similar to cuticular collagen" 

/codon_start=l 

/ evidence=not_experimental 

/protein_id="AAB97570 .1" 

/db_xref="GI: 2804460" 

/translation="MWKFVIGSVSTAAFFVSVCTIYFSVSMLDELDSFRLSIRDELED 

WKEVSDDTWQRLNDMTSRNVPKKTNILKEFVRGKRNVGNDQCNCAEPTKNCPAGPPGE 

KGSLGNPGQPGPDGVDGDNGVDGDWIHDMPNPKECIKCPAGPPGPPGPPGPLGPRGD 

KGPSGPRGALGDQGETGPVGEIGDQGPPGSAGRAGPRGQAGQPGTIAIVGLAGRPGPQ 

GPLGEPGAQGEPGVDGKDGALGAPGRKAENGRPGKRGKDGVAGVPGTRGKEGEDAGYC 
TCPPRTA" 

gene complement (17417 . .21259) 

/gene="K09H9 .5" 

CDS complement {join(17417, .17689,17939. .18115,18335. .18665, 

19906. .19976,20133. .20192,20243. .20356,21065. .21259))' 
/gene="K09H9.5" 

/note=" contains similarity to S. cerevisiae SSFl 

(SW:P38789) andSSF2 (SW:Qi2153)" 

/codon_start=l 

/ evidence=not_experimental 

/protein_id="AAB975 72 .1" 

/db_xrefs"GI : 2804462" 

/translation="MKILTVFLLVAISANCATPCDHQDAVKSMECKPLLGQLAGEMAK 
HADIPPPDELKEFSLLCEQTLACMEQVKCDLLKNATGLIAKTCTGIKLMSGPFGQCID 
DLRSTPPSLKKYPCGRFLKTEKGRPGNCQMYQDELECTKHLTSDRFTMVLFFDKVPIK 
EYFTKLFNFYVDFQAENPKYRCVFGKCHWVSFYFFQKFLRVKYGALHNFYYVKFSYH 
KILKCAWNQVYCFDAPCLKFVYFFKFARKKWSLACFHELNASKVLLLLEIFLVIPIYM 
LFLFPWWLMWIGFHLVLILVTIYALRKKKHRFMWPMVLFTLSQFLIWGLLTVIQLLIA 

FFDTQSFLNFYSQGHHEEFFEKALWIWKLLVFAIGAFLFWRFSVYYATKNYYSDRL 
EGQVSLTSTVILLQ" 
gene complement (22612 . .25 875) 

/gene="K09H9 .6" 

CDS complement (join (22612. .23071,23125. .23648,23857. .23899, 

24184. .24309,24361. .24437,25223. .25404,25461. .25875))' 
/gene="K09H9.6" 

/note=" contains similarity to human RNA-binding protein 

FUS/TLS (SW:Q28009)" 

/codon_start=l 

/ evidence =not_experimental 

/protein_id-"AAB97575 .1" 

/db_xref^"GI : 2804465" 

/ translations "MKMKKGKKQRKSGAHNRGNVDALSQKDALYHQEKFVKKLQKQKF 
LENKEIELARQPHCLVIHRGDVGKYVKGLESDLRNLVEPNTAKNLKILKRNNIKDFIV 
NGAVI/SVTNMMVLTSSDASLQLRMMRFSQGPTLSFKVKQYSIiARHVVNCQKRPVATDK 
LFKSSPLWMNGFGDGTQKHLSLVQTFIQNMFPSINVDTIQLGNLKRCLIVSYDEETD 
EIQMRHFAIRWASGLNKSVKKLMQAEKTMGKNIPNLSTYKDISDYFLNPGQFRIRTK 
LNFSNINHKIIIKKYSIYLNFPSFSSPGQLSDSEFEGDQQEVELPQDISEGRGCGVGQ 
KSNVRLHEIGPRLTLELVKIEEGIDEGEVLYHKHNAKTPDELIKLRAHMDKKKQMKKR 



REQESEQRVIRRLTIVKEQQDAEEAEVKAIRENAARKQAAATGQVEEVENQKEKDREI 
AMNRERDLKRANEEWGTSEASKRPRYEDSRGGFRGGFRGRGEDRGGFRGRGGDRGGFR 
GRDRDGGGFRGRSVDRGGFRGGGGDRGGFRGRSSDRGGDRGGFRGRSGDRDGGFRGGF 
GGRGGGGFRGGDRGGFRGRGGGGGFRGGRGGDRGGGFRGGRR" 

misc_feature 26104. .26105 

/note="SLl trans-splice site; see EST yk418f6.5" 

gene 26108. .28395 

/gene="K09H9 .2" 

CDS join(26108. .26329,26385. .26654,26729. .27064,27249. .27346, 

27392. .27515,27892. .28395) 
/gene="K09H9.2" 

/note="coded for by C. elegans cDNA yk418f6.5; coded for 

by C. elegans cDNA yk414dl0.5; coded for by C. elegans 

cDNA yk418f6 .3" 

/codon_start=l 

/protein_id= " AAB97S6 9 . 1 " 

/db_xref="GI :2804459" 

/trans lation="MNRFVTRSKPTQPTKRVAPAPASGNEENIVAPEPQKTLEEPSVM 
EKLQHAIREKDAQFTFDILSKNRPFDQAMNKTMTKMEFSPDYSDPSSEFMLLSVDESV 
IGAFSEGQSLTIRGDSTDDAVLCTDNATFPMKIIESATTVLLLHKCLEAPDSPTLPNF 
QVKTIGGKCFSTGELCPQVDVLSIGRLKDMLREQELRWDWKDREEEENLKGYKLRDLL 
NSVQMSVEEVKIALNDLPWKFSSGRYRYLSHKYRGEMLGLIVELLDEDSIGDVKLEK 
YFPEKKLNFSIFRCFFPTKIVKTENFSFSEVLLFFSVFSGKKTEFFNFSWFFKQKSK 
KNPEKHENSVFRSFFSISRASLRSNLPENVPDEVIDWFLNSRCEQNPEDPFTYRLPEI 
NLIRDLTWILYGTQKMPLQQFRELLHKILPYGVTMNIAQLEGVADISDAPFGKVISY 
LSPEDLPDTIKERMLYLFEYRKLWSMEQLRPYFKDVYKSKVAFDKYWQNCDYALSES 
NEMLYCGVR" 

gene complement (28869 . .30417) 

/gene="K09H9 . 7" 

CDS complement (join (28869 . .28914,28963. .29124,29173. .29257, 

29731. .29862,29998. .30087,30393. .30417)) 
/gene="K09H9 .7" 
/codon_start=l ■ 
/evidence=not_experimental 
/protein_id="AAB97573 .1" 
/db_xref="GI : 28044 63" 

/translation="MSTAVFANGNQRFPMNSTADSGTFVAKFNAWIDFIGSKKAFFTN 
LKILAENGIHESVIRALVLHNAKLPIPKCVATESSVEIYNSATISGIFSAFSVRCCKC 
SDNVTIPNLYKHMKQHVEKTRYSCSTCSGRFETEEAGLHHLLIHRRRRNVKLFDEWDG 
EAEKRIDQLLMRSVARKRD " 
gene 30469. .33659 

/gene="K09H9 .1" 

CDS join(30469. .30737,30856. .309-92,31134. .31420,32381. .32483, 

32992. .33659) 
/gene="K09H9.1" 

/note="similar to C. elegans PHA-1 (GB:X74228) " 

/codon_start=l 

/ evidence =not_experimental 

/protein_id="AAB97574 .1" 

/db_xref="GI: 2804464" 

/translation="MFTEHLSSELAEPLSPNFRNCETRGKELRPQRAVNIVSRGTREM 
SGNDITYPREHRLHLKEFLNSRVFAAQFLQFGKIFKKYTSKREMRQILAFSVDFLRSL 
LSLVNKCLYFYMERNNFLFFVEIGLSEDRSLASLNSIFEEISSNSQPSHHHTNDSITL 
PYDATIQKIAEKEKNLSKLLKELSIVDSSLEEEKASFEKTQSALLKEKGKIEKDKLML 
REEAQKQASMNAEMTTIVDSVFNNEAIVSRVLEKTSKNSISNLDLRVQLDCLGFGAIS 
PIARFDVLPSTQPQSMYILEEVQINVQFAKTLIDELDKDESLFDNFIANSQRIFPTKR 
VTVEFPMKYIQLSIGNLEKLIKKCMKMVSKEPQRNTTTIFRMHLQDANFSQSVNRVFN 
IILCDEIPEIFHSSENCKSDDTSDCWHLETIQPIPEQKANQFYGDDTILTGKCFCLKN 
LRLNSCINYQLLFEKQFLIQFFKEKTGFMRHFYDLVS " 

alignment_scores : 

Quality: 38.00 Length: 7 

Ratio: 5.429 Gaps: 0 

Percent Similarity: 100.000 Percent Identity: 100.000 

alignment_block : 
US-09-403-269-7 x CELK09H9 

Align seg 1/1 to: CELK09H9 from: 1 to: 37881 



1 PheLeuSerGluGlnHisGly 7 



ID t\JQ 



RESULT 3 

DRORCPA/c 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 
FEATURES 

source 



misc feature 



gene 



CDS 



DRORCPA 5619 bp mRNA INV 17-SEP-1999 

Drosophila melanogaster ring canal proteins (kel) mRNA, complete 
cds . 
L08483 

L08483.1 01:577275 

intercellular bridge; ring canal protein, 
fruit fly. 

Drosophila melanogaster 

Eukaryota; Metazoa; Arthropoda; Tracheata; Hexapoda; Insecta; 
Pterygota; Neoptera; Endopterygota ; Dipt era; Brachycera; 
Muscomorpha; Ephydroidea; Drosophilidae ; Drosophila. 

1 (bases 1 to 5619) 
Xue,F. and Cooley,L. 

kelch encodes a component of intercellular bridges in Drosophila 
egg chambers 

Cell 72 (5) , 681-693 (1993) 

93201592 

8453663 

2 (bases 1 to 5619) 
Robinson, D.N. and Cooley^L. 

Examination of the function of two kelch proteins generated by stop 
codon suppression 

Development 124 (7), 1405-1417 (1997) 

97236487 

9118811 

3 (bases 1 to 5619) 
Cooley, L. 

Direct Submission 

Submitted (13-JAN-1993) Department of Genetics, Yale University 

School of Medicine, New Haven, CT 06510, USA 

On Nov 28, 1994 this sequence version replaced gi: 158234. 

Location/Qualifiers 

1. .5619 

/ organ ism=" Drosophila melanogaster" 
/ db_xr e f = " t axon : 7 2 2 7 " 
/dev_stage="0-4 hr embryos" 
/tissue_lib="Nick Brown" 
1. .183 

/organism= "Drosophila melanogaster" 
/db_xref="taxon: 722 7" 
/tissue_type="0-4 hr embryos" 
/tissue_lib="Nick Brown" 
1. .183 
/gene="kel" 

/note=" sequence determined from genomic DNA clone" 
1. .5619 
/gene=" kel" 
/note= "kelch" 
279. .4712 
/gene="kel" 

/note="0RF2; putative stop codon suppression by 
selenocysteine incorporation" 
/codon_start=l 

/transl_except= (pos : 2346 . . 2348 , aa : OTHER) 
/product="ring canal protein" 
/protein_id="AAA53472 .2" 
/db_xref-"GI : 5902659" 

/translation="MIALSALLTKYTIGIMSNLSNGNSNNNNQQQQQQQQGQNPQQPA 
QNEGGAGAE FVAP P PGLGAAVGVAAMQQRNRLLQQQQQQHHHHQN PAAEG SGLERGS C 
LLRYASQNSLDESSQKHVQRPNGKERGTVGQYSNEQHTARSFDAMNEMRKQKQLCDVI 
LVADDVE I HAHRMVLAS CS P YFYAMFTSFE E SRQARI TLQS VDARALELL IDYVYTAT 
VE VNEDNVQVLLTAANLLQLTDVRDACCDFLQTQLDASNCLG I RE FAD IHACVELLNY 
AETYIEQHFNEVIQFDEFLNLSHEQVISLIGNDRISVPNEERVYECVIAWLRYDVPMR 
EQFTSLLMEHVRLPFLSKEYITQRVDKEILLEGNIVCKNLIIEALTYHLLPTETKSAR 
TVPRKPVGMPKILLVIGGQAPKAIRSVEWYDLREEKWYQAAEMPNRRCRSGLSVLGDK 
VYAVGGFNGS LRVRTVDVYD PATDQWANCSNMEARRSTLGVAALNGC I YAVGGFDGTT 
GLSSAEMYDPKTDIWRFIASMSTRRSSVGVGWHGLLYAVGGYDGFTRQCLSSVERYN 
PDTDTWVNVAEMS S RRSGAG VGVLNNI LYRVGGHDG PMVRRS VEAYDCETNSWRS VAD 
MSYCRRNAGWAHDGLLYWGGDDGTSNLASVEVYCPDSDSWRILPALMTIGRSYAGV 





CM I DKPMXMEEQGAliARQAAS LiAI ALLDDENS QAEGTMEGAIGGAI YGNLAPAGGAAA 
AAAPAAPAQAPQPNHPHYENIYAPIGQPSNNNNNSGSNSNQAAAIANANAPANAEEIQ 
QQQQPAPTEPNANNNPQPPTAAAPALSQQQQQQQAQPQQPQRILPMNNYRNDLYDRSA 
ADGVCSAYDVPRAVRSGLGYRRNFRIDMQNGNRCGSGLRCTPLYTNSRSNCQRQRSFD 
DTESTDGYNLPYAGAGTMRYENIYEQIRDEPLYRTSAANRVPLYTRLDVLGHGIGRIE 
RHLSSSCGNIDHYNLGGHYAVLGHSHFGTVGHIRLNANGSGVAAPGVAGTGTCNVPNC 
QGYMTAAGSTVPVEYANVKVPVKNSASSFFSCLHGENSQSMTNIYKTSGTAARMAGHN 
S PLTPNVSMERASRSASAGAAGSAAAAVEEHSAADS I PS S SNINANRTTGAI PKVKTA 
NKPAKESGGSSTAASPILDKTTSTGSGKSVTLAKKTSTAAARSSSSGDTNGNGTLNRI 
SKSSLQWLLVNKWLPLWIGQGPDCKVIDFNFMFSRDCVSCDTASVASQMSNPYGTPRL 
SGL PQDMVRFQS S CAGACAAAGAAST I RRDANASAR PLHS TLSRLRNGEKRNPNRVAG 
NYQYEDPSYENVHVQWQNGFEFGRSRDYDPNSTYHQQRPLLQRARSESPTFSNQQRRL 
QRQGAQAQQQSQQPKPPGSPDPYKNYKLNADNNTFKPKPIAADELEGAVGGAVAEIAL 
PEVDIEWDPVSLSDNETETTSSQNNLPSTTNSNNLNEHND" 



/gene="kel" 

/note="ORFl" 

/codon_start=l 

/product="ring canal protein" 
/protein_id^ "AAA534 71.1" 
/db_xref="GI : 577276" 

/translation="MIALSALLTKYTIGIMSNLSNGNSNNNNQQQQQQQQGQNPQQPA 
QNEGGAGAEFVAPPPGLGAAVGVAAMQQRNRLLQQQQQQHHHHQNPAAEGSGLERGSC 
LLRYASQNSLDESSQKHVQRPNGKERGTVGQYSNEQHTARSFDAMNEMRKQKQLCDVI 
LVADDVE I HAHRMVLAS CS PYFYAMFTS FE E SRQARI TLQS VDARALELL I D YVYTAT 
VEVNEDNVQVLLTAANLLQLTD VRDACCDFLQTQLDASNCLG I RE FAD I HAC VELLNY 
AETYIEQHFNEVIQFDEFLNLSHEQVISLIGNDRISVPNEERVYECVIAWLRYDVPMR 
EQFTSLLMEHVRLPFLSKEYITQRVDKEILLEGNIVCKNLIIEALTYHLLPTETKSAR 
TVPRKPVGMPKILLVIGGQAPKAIRSVEWYDLREEKWYQAAEMPNRRCRSGLSVLGDK 
VYAVGGFNGS LRVRTVD VYD PATDQWANCSNMEARRSTLGVAALNGC I YAVGGFDGTT 
GLSSAEMYDPKTDIWRFIASMSTRRSSVGVGWHGLLYAVGGYDGFTRQCLSSVERYN 
PDTDTWVNVAEMSSRRSGAGVGVLNNILYRVGGHDGPMVRRSVEAYDCETNSWRSVAD 
MSYCRRNAGWAHDGLLYWGGDDGTSNLASVEVYCPDSDSWRILPALMTIGRSYAGV 
CMIDKPM" 



CDS 



279. 



.2348 



terminator 



2346. .2348 
/gene="kel" 

/evidence=experimental 



BASE COUNT 
ORIGIN 



1523 a 



1405 c 1394 g 



1297 t 



Query Match 70,4%; Score 17.6; DB 3; Length 5619; 

Best Local Similarity 6 8.0%; Pred. No. 2.7e+02; 
Matches 17; Conservative 3; Mismatches 5; Indels 0 



0; 



Gaps 



0; 



Qy 



1 ccgaattcaargcnatgytnccnyt 25 



Db 



43 06 CCGAATTCAAAACCATTCTGCCACT 4282 




